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Abstract 

A recent result of Ding, Lee and Peres expresses the cover time of the random 
walk on a graph in terms of generic chaining for the commute distance. Their 
proof is very involved and the purpose of this article is to present a simpler ap- 
proach to this problem based on elementary hitting times estimates and chaining 
arguments. Unfortunately we fail to recover their full result, but not by much. 

1 Introduction 

Let {Xn)n>o be an irreducible Markov chain on some state space M. Given A C M 
let 

T{A) = inf{n > 0: X„ e ^} 
be the first time the chain hits A and let 

T,ov{A) = snpT{x) 

be the first time the chain X has visited every point of A. The cover time of A is by 
definition 

cov(A) = sup(E:rTcot,(A)). 

To avoid trivial situations, the chain is assumed to be positive recurrent throughout so 
that cov(A) < -|-cx) if and only if A is finite. 

Using the strong Markov property it is easily seen that given x, y, z in M 

E,T{y) + EyT{z) 

is the expectation (under P^) of the first time the chain has visited y and z (in this 
order). This implies that 

E^T{y) + EyT{z) > E^T{z). 

Therefore the commute time 

d{x, y) = Ej; r(y) + T{z) 
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is a distance on M. This article deals with the following problem, dating back to 
Matthews' article [6] at least: can cav{A) be estimated in terms of metric properties of 
{A, d)l An arguably definitive answer to this question has recently be given by Ding 
Lee and Peres [3], their result is expressed in terms of generic chaining. 



The generic chaining 

The generic chaining is a tool designed by Talagrand to estimate suprema of Gaussian 
processes. Let us describe it briefly and refer to the book |7j for details. 
Throughout we let (iV„)„>o be the following sequence of integers: 

7Vo = l, A^«-22",n>l. (1) 

Given a set M, a sequence {An)n>Q of partitions of M is called admissible if An+i is a 
refinement of An and if \An\ < Nn for every n > 0, where \An\ is just the cardinality 
of An- The cardinality condition implies in particular that Aq = {A/}. Given a 
sequence of partition {An)n>Q of M and x e M we let An{x) be the only element of 
An containing x. 

Definition 1. Let {M,d) be a metric space. Set 

72(Af,d) = inf [sup (y2"^^A{An{x),d) 

where the infimum is taken over all admissible partitions {An)n>a of M, and A{A,d) 
denotes the diameter of A. 

Recall that a Gaussian process is a family {T{x))xeM of random variables such 
that every linear combination of the variables T{x) is Gaussian. The process is said to 
be centered if E T(x) = for every x. The fundamental result of Talagrand reads as 
follows. 

Theorem 2. Let {T{x))xeM be a centered Gaussian process. Then 

Y72 (M, d)<E sup r(x) < L72 (A/, d) (2) 

where L is a universal constant and d is the following distance on M 

d{x,y) = ^£{T{x)~T{y)Y. (3) 

The upper bound is not specific to Gaussian processes, it applies to any centered 
process {T{x))x^m satisfying 

P(r(a;) - T{y) > u) < e-"'/2<i(^,y)^ (4) 

for all x,y € M, for all m > and for some distance d. Using a union bound it is not 
hard to see that a centered process satisfying (HJ satisfy 



E sup T{x) < CVlogpJ max d{x, y), (5) 
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for every finite subset A of M. The proof of the upper bound of ^ consists in applying 
this union bound repeatedly and at different scales. 

The lower bound is another story, it is specific to Gaussian processes and much more 
difficult to prove. The key estimate is the Sudakov inequality: if {T{x))xeM is a 
centered Gaussian process then for all finite subset A oi M 

EsupT(a;) > cv/iogp] min d{x,y) (6) 

xeA x=iyeA 

where c is a universal constant and d is the L^-distance ([3]). 



The Ding, Lee and Peres theorem 

Cover times satisfy inequalities analogue to ([S]) and ^ due to Matthews 0: for any 
finite subset A of M 

cov(A) < (l + log|A|) max (E,T(y)) 
cov(^) > log|A| min (E,T(y)). 

x=iy€A^ 

In view of these inequalities it seems natural to conjecture that the correct order of 
magnitude for cov{A) is 

+ 00 



7i(Ad) =infUp(V2"A(A.(a;),d) 



rather than 72(^7 d) (recall that d is the commute distance d{x, y) — Ex T{y) + Ey T{z)). 
This is not quite correct. Here is the result of Ding, Lee and Peres. 

Theorem 3. // the Markov chain X is reversible (and if the state space M is finite) 
then 

y[j2{M,Vd)]^ < cov(M) < L[-f2{M,Vd)f 
for some universal constant L. 

Remark. Actually the inequality remains valid when M is infinite. Indeed since 
d{x, 2/) > 1 when x =1 y, we then have 72 (M, v^) — +00. 

The correct order of magnitude 72(M, -s/rf)^ is comparable to our wrong guess: 
clearly 

7i(M,d) < [72(M,Vd)]'. 



Purpose of the present article 

The proof of Ding, Lee and Peres is very involved. In particular it relies on the Ray- 
Knight isomorphism theorem which makes a connection between local times of the 
chain and the Gaussian free field associated to the chain. It may be interesting to have 
a simpler proof relying only on elementary hitting times estimates and on Talagrand's 
generic chaining. The purpose of this article is to provide such a proof. 
Unfortunately we fail to recover entirely Theorem [3J here is what we prove. 
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Theorem 4. // X is irreducible and positive recurrent, then 

cov{M) < L[j2(M,Vd)Y. (8) 
for some universal L. More generally we have 

cov{A)<L[j2{A,Vd)]\ (9) 

for every subset A of M . 

Inequality (|8]) is slightly stronger than the upper bound of Theoreni[3]since the chain 
is no longer assumed to be reversible. Besides, it is not clear whether the approach of 
Ding, Lee and Peres yields (|9]). 

Theorem 5. If in addition the chain X is reversible then 

7i(Af,d) < Lcov(M), (10) 
where L is a universal constant. Again we actually have 

ji{A,d) <Lcov{A), 

for every A (Z M . 

Remark. The reversibility assumption is necessary. Indeed, consider the discrete torus 
Zjv and the Markov kernel given by 

P{x,x + 1) = 1, VxGZat. 

Clearly d(x, y) = N for all x ^ y, which implies that 

7i(r,d)«iviog(7V). 

On the other hand TcovC^n) = N p.s. (whatever the starting point). 

Since 7i(M, d) < [72(M, -\/d)]^ inequality ([T0|) is weaker than the lower bound of 
Theorem [3l Let us comment a little bit more on this. In order to compute 71 (A/, d) 
one can restrict to partitions {An)n>Q satisfying 

An = {{x}, xeM} 

for n > k, where k is the only integer satisfying 

Nk-i < \M\ < Nk. 

Then by convexity we get 

00 ^ k 2 

(5^2"/VA(A„(x),d)) = (^2"/VA(A„(a;),d)) 

00 

< (fc + l)5]2"A(A„(a;),d) 

Tt = 

for every x £ M , yielding 

[72 (Af, Vrf)]' < Clog(log|Af|)7i(A//,d) 

for some universal C (provided \AI\ > 3). Therefore the estimate (|10p is off the correct 
order of magnitude by at most a factor log(log| Af|). This is sharp, there is a Markov 
chain for which the gap is indeed log(log|Af|) (see the appendix). 
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2 The upper bound 

Since {Xn)n>o is an irreducible, positive recurrent Markov chain, there is a unique 
invariant probabihty measure which we denote by tt. The purpose of this section is to 
bound 

E sup T{x) 

through a chaining argument. Since no estimate such as (jj]) is available for hitting 
times, the chaining procedure will be different from Talagrand's, and is taken from the 
articles Hg]. 

We need a couple of additional notations. Recall that 

T{x) = inf (n > 0, X„ = x) 

is the hitting time of x. Let 

T^{x) = inf(n > 1, X„ = x) 
be the first return time to x and for k > 2 define inductively the fc-th return time to x 

by 

T''{x) = inf(n > T''-\x) + 1, X„ = x). 
We also let T'^{x) = by convention. Lastly, let 

n=0 

be the empirical measure of the chain X . In other words Nk{x) is the number of visits 
to X before time k. 

The following deviation estimate is due to Kahn, Kim, Lovasz and Vu [J. 
Lemma 6. Let x ^ y in M . Then for every e > and for every integer k 



< cxp 



{x) J I A'K{x)d{x,y) 

Let us sketch the argument. Because of the strong Markov property, under P^: the 
variables 

(^T.(x)(2/)-A^T-i(x)(2/)),>i 

are independent and identically distributed. And it is a standard fact (see for in- 
stance [U chapter 2]) that their law is geometric: for every integer r 

Px {Nt1(x) (y) > f ) = Pxy{l - PyxY, 

where 

Pxy = PxiT{y) < T\x)) ^ 



ry ■ XV- va; - v-yy TT{x)dix,y) ' 

The previous lemma is thus a Hoeffding type estimate for sums of independent geo- 
metric variables. We refer to [1] for the details. 
Our next tool is taken from Barlow, Ding, Nachmias and Peres [2]. 
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Lemma 7. Let A be a finite subset of M , let z ^ A and let k be an integer. Then 



Ez Tcov {A) < 



k 



7r{z)Pz{T,ov{A)<Tk{z)) 
Again we sketch the proof and refer to [2] for details. Let 

iV-inf(n> 1, T,„„(A) < r"'=(z)). 

Then by Wald's identity 

E.Teo,(A) < EzT^\z) - E,(iV) E,T\z). 

On the other hand if N is larger than n then the walk fails to cover A during any of 
the following intervals of time 

[0,T'=(z)), [T\z),T^\z)),..., [T("-i)'=(z),r"'=(0)) 

so that 

PAN > n) < Pz{T,ov{A) > T\z)Y. 

The result follows. 

The authors of P| combine these two lemmas with a nice chaining argument. Although 
it is not written this way, their result is essentially the Dudley version of Theorem |4j 

oo 

cov(M) < Li^ e„(M, v^)2"/2)^ (11) 

n=0 

where 

e„(M, Vd) = inf(sup d{x, A)) 

where the infimum is taken over all subsets A of M satisfying | A| < iV„. This is weaker 
than Theorem HI Indeed swapping the sup and the sum in the definition of 72 , it is 
easily seen that 

00 

72 (T, Vrf) <C^e„(A/,yd)2"/2, 

Tl = 

for some universal constant C. We show that it is possible to modify BDLP's chaining 
argument to obtain Theorem 21 

Let z,x,y in M such that x ^ y and let fc, / be two integers larger than 1. Observe 
that 

Pz{T\y) > T\x)) - P^Nt^^xM <l-l) 

< PANT^t^x){v)-NTHx){y)<l-l) 
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The latest equality being a consequence of the strong Markov property, li {l—l)/7T{y) < 
{k — 1)/tt{x), applying Lemma [6] to k — 1,1 — 1 and 



e = 1 



(fc - l)n{y) 



gives 



P,(r'(y) >T'=(a;)) <exp 



(12) 



This will be our key estimate. Lastly, we shall use the following elementary fact: if x 
and y are distinct elements of M then 
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= T\x) < E, T{y) + Ey T{x) = d{x, y). 



tt{x) 

Let us reformulate Theorem |4l 

Proposition 8. Let A C M, let z & A and let {An)n>o be an admissible sequence of 
partitions of A. Then 



^ATcoviA)) < L(sup 2"/VA(A„(x))j . 

Recall that A„(x) denotes the only element of An containing x. Also IS. denotes the 
diameter with respect to the commute distance. 

Proof. Let tQ,{A) = z, and for each n and for each B £ An let tn{B) be an arbitrary 
element of B. Given x £ A, we let x„ = i„(yl„(x)). We can assume that A is finite 
and that 

An = {{a;}, x£A} 

for n large enough (the right hand of the desired inequality equals +oo otherwise). 
Therefore x„ = x eventually. Let 

+ 00 



r„(x)= sup ^2'=/VA(A,(y)) 



and 



kn{x) = [34 • 7r(x„)r„(x)ro(x)J + 1, 

where [rj denotes the integer part of r. Observe that r„(a;) and kn{x) depend only on 
An{x). In particular fco(a;) depends on nothing. Also 

rn{x) ~ rn+i{x) > 2"/VA(A„(x)) 



> 2"/V<2:„,x„+i) 

We claim that for every x and n 

P.(T'="+i(-)(x„+i) >T'="(-)(a;„)) < 



e-2"- < 



n+3 



(13) 
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Indeed, if x„ = Xn+i then kn-f-i{x) < kn{x) and the inequality is trivial. Otherwise 
write 

fc„(x) - 1 fc„+i(x) - 1 ^ 34 . _ 1 



7r(a;„) 7r(a;„+i) 7r(x„) 

> 34 • 2"/^^d{xn,Xn+i)ro{x) - — 

7r(a;„) 



Since Xn ^ a:„_|_i and d{xn, Xn+i) < 7'o(a^) we have 

< y/d{xn,Xn+i)ro{x). 



n{xn) 
Therefore 

- > (34 . 2-/2 - l)Vd(x„,.x„+i)^o(a;) 

7r(a;„) 7r(a;„+i) 

i)?'o(a;). 

Also 

^"^f^ T ^ < 34 . r„(x)ro(a;) < 34 • ro{x)\ 
7r(x„) 

Since 33^/(4 • 34) > 2^, combining (IT^ with the last two inequalities yields (|13l) . 
The number of possible couples (a;„, is at most NnNn+i. Recall the definition ^ 

of Nn and observe that < Nn+i for all n. A union bound shows that the probability 
that there exists x and n such that 

is at most 

iV„+3 - iV„+2 ~ ^ 8 ■ 

n>0 n>0 + n>4 

Therefore with probability at least 7/8, we have 

T'="+i(-)(a;„+i) <r'="(^)(a:„) 

for all a; and n, hence 

T'^-"(^)(a;„) < r'=«(^)(a;o) = r'="(z). 
Since a;„ = x for n large enough and fc„(a;) > 1 we obtain 

Vx g ^, T(a;) < r'="(z) 
with probability 7/8 at least. In other words 

Pz{Tcov{A)<T''°{z))>l. 
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Together with Lemma [7] we get 

E.T_(^)<-^<|(34.r^ + 4^). 

7tt(z) 7 n(z) 

Unless A = {z}, in which case cov{A) = and there is nothing to prove, we have 
l/Tr{z) < A{A) < rl- Therefore 

• 35 



Tcov{A) < ^(sup V 2"/VA(A„(x)) . □ 



3 The lower bound 

First let us slightly modify the definition of cover time: given A C M let 

cov_(A) = min TcoviA) 

xeA 

cov+(A) = maxEa;Tcoi,(A) 
In this section we prove the following 

Proposition 9. Let X be an irreducible, positive recurrent Markov chain on a discrete 
state space M . If the chain is reversible then for every finite subset A of M 

7i(Ad) < £(cov_(A) + A(A,d)), 

where L is a universal constant. 

Remarks. 1. This yields Theorem [5] since clearly 

cov_(A) < cov+(yl) 
A{A,d) < cov+(yl). 

2. The term A{A, d) cannot be removed from the inequality. Indeed if M = {0, 1} 
and the transitions are given by the matrix 

e 1 - e 
1 - e 

then 

7i(M,d) > A(M,d) ^ 



eil-e) 

whereas cov_(Af) = min(i, jzr^)- 
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Talagrand's growth condition 

Recall the majoring measure theorem: il {T{x))xeM is a centered Gaussian process 
then 

72(r,d) < LE sup T{x), 

where d is the distance ([3]). The proof of Talagrand consists in showing (using 
Sudakov's inequahty) that the functional 

^ E sup T{x) 

satisfies an abstract growth condition, and that such functionals dominate 72. Here is 
the definition of the growth condition adapted to the 71 situation (rather than 72). 

Definition 10 (Growth condition). Let {M,d) be a metric space. A functional 
F: 'P{M) — > M+ is said to satisfy the growth condition with parameters r > 1 and 
T G N if for every step n G N and every scale a > the followings holds. Let 
m — Nn+Ti for every sequence Hi, . . . , Hm of non-empty subsets of M satisfying 

1. A{\J^<■mH^) < ra, 

2. d{m,Hj) > a for all i ^ j, 

3. A{H,) < a/r for aU i, 
we have 

F(U<™iJ.) > a2" +mini^(i?,0- 

Theorem 11. If F is a non- decreasing for the inclusion and satisfies the growth con- 
dition with parameters r and r then 

7i(M,d) < L2^{A{M,d)+rF{M)), 

where L is a universal constant. 

We refer to [7] for a proof of this theorem. The purpose of the rest of this section 
is to show that the functional 

A i-> cov-{A) 

is non-decreasing and satisfies the growth condition on (M, d) (where d is the commute 
distance) with universal parameters r and r. 

Lemma 12. The functional A 1— > cov-{A) is non- decreasing for the inclusion. 

Proof. We use the strong Markov property. The shift operator is denoted by a: for 
every integer k 

Let A C B and let x E B. Then 

^COV 

[B) > T{A) + Tcov{A) o ax{A)- 
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In words: at time T{A) the chain has yet to visit every point of A\{Xt(a)}- By the 
strong Markov property 

Tcov{B) > E, T{A) + E, [Exr{A) Tcov{A)] . 

> E^T(A) +cov_(A) 

> cov_(74), 

which is the resuh. □ 

Variations on Matthews' bound 

The following is due to Matthews |:6 . 

Lemma 13. Let A be a finite subset of M , let a > and assume that T{y) > a for 
every x ^ y in A. Then 

\A\-1 ^ 

cov_(A) >a J2 ^ >alog(|A|). 

k=l 

Proof. Let x ^ A. Assuming that \A\ > 2 (otherwise the result is trivial) we have 

P4T,oviA)^T{y)) = l. 

So there exists y & A such that 

P4T,o4A)=T{y))>jj^. (14) 

Let A' = A\{y}, let S = TcoviA') and let T = TcoviA). Clearly 

S+ {T{y) o as)l{s<T{v)}- 

By the strong Markov property 

E^ T = E:r + E:, [{Exs T{y))l{s<Tiy)}] ■ 

On the event {S < T{y)} the point Xs is an element of A different from y. Therefore 
Exs T{y) > a. Together with (HH) we obtain 

Ex Tcov 

{A') 



\A\~1 

An obvious induction on \A\ finishes the proof. □ 

The following lemma is proved the same way. 
Lemma 14. Let Hi, . . . ,Hm be non-empty subsets of M satisfying 

E. T{y) > ay{x, y) e H, x H^, \fi ^ j. 
Then for all x G Ui<,„i?i 

ExTiia,yiT{Hi) > alog(m). 



11 



An additional application of the strong Markov property yields the following re- 
finement of the previous lemma. 

Proposition 15. Let Hi, . . . ,H„i be non-empty subsets of AI satisfying E^Ty >a for 
all (z, y) Cz Hi X Hj, for all i ^ j . Then 

cov_(l I Hi) > alog(m) + min cov_ (15) 

Proof Let x G U,<„ff,. Let S = max,<„, T{H,) and T = T,,,{U,<^H,). If S = T(H,) 
then at time S the chain has yet to visit every point of Hi\{Xs}. Therefore 

m 

T> S + ^{Tcov{Hi) o as)l{s=T(Hi}} 
Using the strong Markov property, we get 

m 

2=1 

> Ea; + mincov_(_ffj). 

Together with the previous lemma we get the result. □ 

We are close to desired growth condition. We would like to obtain the inequality 
under the weaker hypothesis 

d{x, y) = Ex T{y) + Ey T{x) > a, Vx, y ^ HiX Hj,i ^ j. 

This is done in the next section. Roughly speaking, reversibility insures that for a 
reasonable proportion of x and y the hitting times E^ T{y) and Ej^ T{x) are of the same 
of order of magnitude. 

Reversibility 

Again this part of the argument is taken from Kahn, Kim, Lovasz and Vu's article [1]. 
We start with a simple lemma concerning directed graphs. Given a directed graph 
G = (V,E), a path of G is a sequence xi, . . . , Xm of vertices satisfying (xi, Xi+i) G E 
for i < m. The length of such a path is defined to be m. An independent set is a 
subset A of V satisfying {x, y) ^ E for all x, y in A. 

Lemma 16. // every path of G has length at most m then G has an independent set 
of cardinality at least \V\/m. 

This is standard, but we still sketch the argument. It is easy to show by induction 
on rn that G is then m-colorable: it is possible to map the vertices of G to {1, ... , m} 
in such a way that connected points have different images. Then by the pigeon hole 
principle, at least |y|/m vertices have the same image, which is the result. 
From now on the chain (X„)„>o is assumed to be reversible. Consequently, we have 
the following commuting property for hitting times. 
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Lemma 17. For every sequence xi,. . . ,Xm of elements of M we have 

Exi T{X2) H h ^x^-i T{xm) + ^x„, T{xi) ^^^^ 

= Ext T{Xm) + T(Xm-l) H h E^^^ T{xi). 

We refer to [51 Lemma 10.10] for a proof. 

Corollary 18. Let A he a subset of M and a > 0. // A(A,d) < 16a and ifd{x,y) > a 
for all X ^ y in A then there exists a subset A' of A satisfying 

• \A'\ > \A\/33. 

• Ex T{y) > a/4 for all x ^ y in A' . 

Proof. We define a graph G with vertex set A by saying that the edge [x, y) is present 
ii X ^ y and E^; T[y) < a/4. Let xi, . . . , Xm be a path of G. Then the inequahties 

Exi T{xi+i) < a/4 
T{x,) > 3a/4 

and equation give 

(m — l)a 3(m — l)a 

h Ex„^ T{xi) > h Ext T[x„i). 

Together with the bound on the diameter of A we obtain m — 1 < 32. Therefore G has 
an independent set of cardinality at least |A|/33. This is our set A'. □ 

The growth condition for the cover time 

Proposition 19. The functional A i— )• cov_ (A) satisfies the growth condition with 
parameters r = 16 and t — 5. 

Proof. Let n e N, let a > and m — A^n+s- Let Hi, ... , Hm satisfy 

1. A(U,<™iJ,) < 16a. 

2. d{H„Hj) > a, for all i ^ j. 

3. A(i7,) < a/16 for all i < m. 

Let xi, . . . , Xm belong to Hi, . . . , Hm respectively. By the first two properties and 
Corollary [TBI there exists a subset / of {1, . . . , m} satisfying 

• |/| > m/33. 

• Exi T{xj) > a/4 for every i ^ j in I. 
Let i ^ j in I and let {x, y) E Hi x Hj. Then 

E. T{y) > Ex, T{x,) - Ex, T{x) - E, T{x,) >^-^-^ = ^. 
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Proposition (TS] gives 

cov^{[J Hi) > ^log(|/|) + mincov_(fl",). 

Since 

|/| > iV„+5/33 > iV„+5/A^3 > > e*-2" 

we obtain 

cov_( I J Hi) > aT + mincov_(7J.,), 

which is the result. □ 

Then, by Theorem I 111 we obtain 

7i(Af,d) < i(cov_(M) + A(M,d)). 

Obviously we can replace M by any subset A of M in this inequality: if a functional 
F satisfy the growth condition on (M, d) then it also satisfies it on (A, d). 



Appendix 

We have seen in the introduction that for any metric space (M, d) 

[72(A/, Vd)]'< Clog(log|M|)7i(M,d). (17) 

We show in this appendix that this is sharp and that the example saturating the 
inequality can be chosen to be the state space of a reversible Markov chain equipped 
with the commute distance. The example is taken from [1^ and was pointed out to the 
author by James Lee. 

Let M be a rooted tree of depth D (large enough) satisfying 

• nodes at depth i <D ~ \ have A^^ + 1 children, 

• edges between depth i and depth i + 1 have multiplicity 2*, 

and let X be the random walk on this graph. The probability measure defined by 
7r(a;) = d{x)/2E for every x, where d{x) is the number of edges (counted with mul- 
tiplicity) starting from x and E is the total number of edges, is reversible. Let us 
compute the commute distance d. Because of the tree structure it is easily seen that 

n-l 

d{x,y) ^'^d{x^,Xi+i) (18) 

2 = 

where xq, . . . , a;„ is the shortest path from x to y. Therefore it is enough to compute 
d{x, y) when x and y are neighbors, in which case we use the formula (see [T]) 
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Because of the tree structure again Px{T{y) < T^{x)) is just the transition probabihty 
from X to y. We obtain 

y)^2E- 2-' 

when (a;, y) is an edge between depth i and depth i + 1. When x and y are any two 
nodes of M, equahty ()18|) then imphes that 

E -2-'+^ < d{x,y) < E -2-'+^ (19) 
where i is the depth of their closest common ancestor. 
Proposition 20. There is a universal constant C such that 

D ■ E 



C 

d-Ve 



<li{M,d) <C ■ D ■ E (20) 
<-/2iM,Vd)<C-D-VE. (21) 



C 

Since D is of the order of log(log|M|), this shows that (|T7l) is sharp (up to the 
constant). 



Proof. Let us start with the upper bound of ()20p . It is more convenient to use the 
following definition for 71: 

+00 

71 (Af, d) = inf sup V 2'd{x, M,) 

where the infimum is taken over every sequence {Mi)i^^ of subsets of M satisfying 
the cardinality condition \Mi\ < Ni for every i. It is well known (see [7]) that this 
definition coincides with the one with partitions, up to a universal factor. 
For < i < D let Si he the set of vertices of depth at most i. Using (|19p we obtain 
d{x, Si) < E ■ 2-*+3 for every x G M. Therefore 

+ 00 D 

sup^Td{x,S^) < E^T2-'+^ = 8E-{D + 1). 

j=0 i=0 

Besides, it is easily shown that 

\S,\ < N,+3- 

The sequence (5i)„gN does not quite satisfies the right cardinality condition, but this is 
not a big deal. If we shift the sequence by letting Mq — Mi — M2 — Sq and Mi = Sis 
for i > 3, we still have 

+00 

sup 2'd{x, Mi) <C ■ E-D 



for some universal C, which proves the upper bound of pop . 

To prove the lower bound we need to show that the previous sequence of approximations 
is essentially optimal. Let (Afi)i>o be a sequence of subsets of M satisfying \Mi\ < Ni 
for every i. A vertex x of depth i < D — 1 has Ni + 1 children. So at least one them, 
call it y, has the following property: neither y nor any of its offsprings belong to Mi. 
Using this observation, we can construct inductively a sequence Xo,Xi, . . . ,xd, where 
Xq is the root of M and such that 
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• XiJf-i is a child of Xi, 



• neither Xi-^-i nor any of its offsprings belong to Mi, 

for every i < D — I. Let i < D — 1 and let x £ Mi. Since x is not an offspring of Xi+i 
we have d{x,XD) > E ■ 2"'+!. Thus 

CX3 D-1 

^rd{xD,Mi) >eJ2 2'2-'+i ^2E-D, 
which proves the lower bound of (|20p . 

Inequality (|21l) is proved exactly the same way. □ 
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